Artificial duplicate reads in sequencing data of 454 Genome Sequencer FLX System.

نویسندگان

  • Hui Dong
  • Yangyi Chen
  • Yan Shen
  • Shengyue Wang
  • Guoping Zhao
  • Weirong Jin
چکیده

The 454 Genome Sequencer (GS) FLX System is one of the next-generation sequencing systems featured by long reads, high accuracy, and ultra-high throughput. Based on the mechanism of emulsion PCR, a unique DNA template would only generate a unique sequence read after being amplified and sequenced on GS FLX. However, biased amplification of DNA templates might occur in the process of emulsion PCR, which results in production of artificial duplicate reads. Under the condition that each DNA template is unique to another, 3.49%-18.14% of total reads in GS FLX-sequencing data were found to be artificial duplicate reads. These duplicate reads may lead to misunderstanding of sequencing data and special attention should be paid to the potential biases they introduced to the data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Short Communication Artificial duplicate reads in sequencing data of 454 Genome Sequencer FLX System

Hui Dong1, Yangyi Chen2, Yan Shen2, Shengyue Wang1, Guoping Zhao1,2*, and Weirong Jin1,2* Chinese National Human Genome Center at Shanghai, Shanghai 201203, China National Engineering Center for Biochip at Shanghai, Shanghai 201203, China *Correspondence address. Tel: þ86-21-51320288 (G.Z.)/þ86-21-51320298 (W.J.); Fax: þ86-21-51320288 (G.Z.)/þ86-21-51320298 (W.J.); E-mail: [email protected] (G....

متن کامل

De novo assembly and genomic structural variation analysis with genome sequencer FLX 3K long-tag paired end reads.

The Genome Sequencer FLX System from Roche and 454 Life SciencesTM is a versatile sequencing platform suitable for a wide range of applications, including de novo sequencing and assembly of genomic DNA, transcriptome sequencing, metagenomics analysis, and amplicon sequencing. The Genome Sequencer FLX enables long sequence reads separated by kilobase distances of genomic DNA. These Long-Tag Pair...

متن کامل

Comprehensive transcriptome analysis with the Genome Sequencer FLX System

Protein isoforms make the transcriptome complex and challenging to resolve. Different forms of a protein may be produced from related genes or may arise from the same gene by alternative splicing. With a combination of long (400–500 bp) 454 SequencingTM reads, dedicated GS De Novo Assembler software and a straightforward protocol using just 200 ng of RNA as sample input, the Genome Sequencer FL...

متن کامل

Comparison of Sequence Reads Obtained from Three Next-Generation Sequencing Platforms

Next-generation sequencing technologies enable the rapid cost-effective production of sequence data. To evaluate the performance of these sequencing technologies, investigation of the quality of sequence reads obtained from these methods is important. In this study, we analyzed the quality of sequence reads and SNP detection performance using three commercially available next-generation sequenc...

متن کامل

Simple, sensitive, and swift sequencing of complete H5N1 avian influenza virus genomes.

The spread of highly pathogenic avian influenza A virus (HPAIV) of subtype H5N1 demands fast and reliable methods for in-depth, full-length sequence analysis. For this purpose, we designed a simple and sensitive method for the preparation of sequencing libraries from H5N1 HPAIV diagnostic RNA samples for sequencing with the Genome Sequencer FLX instrument. The method presented seamlessly integr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Acta biochimica et biophysica Sinica

دوره 43 6  شماره 

صفحات  -

تاریخ انتشار 2011